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DESCRIPTION 

A GENE ENCODING AN ENZYME FOR CATALYZING 
BIOSYNTHESIS OF LIGNAN, AND 
USE THEREOF 

TECHNICAL FIELD 

The present invention relates to enzyraes for catalyzing 
the biosynthesis of piperitol and sesamin in sesame, and to 
genes encoding the enzymes. The invention also relates to use 
of such enzymes and genes. 

BACKGROUND ART 

Sesame plants, generally known as sesame [Sesamum 
indicum), belong to family Pedaliaceae and genus Sesamum. 
Sesame is native to central Africa and is the oldest cultivated 
oil plant with the history of over 6000 years. Sesame has been 
cultivated in various parts of the world. 

Sesame seeds contain about 50% lipid and about 20% 
protein, in addition to various vitamins including vitamin Bl, 
B2, and E. The main lipid component of sesame seed is 
triglyceride whose chief constituents are oleic acid and 
linoleic acid. Sesame also contains secondary metabolites 
generally known as sesamin, sesamolin, and lignan, which are 
characteristic components of the sesame plant. 
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Previous research has revealed various physiological 
activities of sesamin, which are found to be effective in 
improving cholesterol metabolism, and liver and immune 
functions (see Publication 1, for example). A separation and 
purification method of sesamin from the sesame seeds or 
wrung residues of sesame seeds has already been put to 
actual applications (see Patent Publications 1 and 2, for 
example). Sesamin is also commercially available as a 
medicament that enhances the liver function by promoting 
alcohol metabolism, etc. Other than sesamin, sesame lignans 
(sesaminol, sesamolin, etc.) have also been reported to have 
various physiological activities (see Publication 2, for 
example). 

There has been some study on the biosynthesis of lignan 
(see Publication 3, for example). Fig. 1 is a schematic 
representation of a common biosynthesis pathway of lignan. 
The lignan is synthesized from a phenyl propanoid compound 
used as a starting material. In plants, lignan is believed to 
play role in the defense mechanism. As illustrated in Fig. 1, 
polymerization of conipheryl alcohol yields pinoresinol as the 
^first" lignan in the iDiosynthesis pathway. From pinoresinol, a 
wide variety of lignans are synthesized through distinct 
biosynthesis pathways of different plant species. 

In the biosynthesis of sesamin, as shown in Fig. 1, 
piperitol is synthesized by the enzymatic action of a piperitol 
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synthetase catalyzing (+)-pinoresinol, forming a methylene 
dioxybridge (circled in Fig- 1). Sesamin is synthesized by a 
sesamin synthetase forming another methylene dioxybridge in 
the piperitol. Experiments using membrane fractions of 
sesame seeds have shown that the enzymes catalyzing these 
reactions were two different kinds of cytochromes P450 (see 
Publication 4, for example). 

The formation of methylene dioxybridge is often seen in 
the biosynthesis of alkaloid or flavonoid. For example, 
membrane fractions from cultured cells of Eschscholtzia 
calfornica are known to include cytochromes P450 that 
catalyze the biosynthesis of (S)-cheilanthifoline from 
(S)-scoulerine, and (S)-stylopine from (S)-cheilanthifoline, by 
forming a methylene dioxybridge in these compounds (see 
Publication 5, for example). 

There have also been reported that membrane fractions 
from cultured cells of chickpea (Cicer arietinum) contain 
enzymes that catalyze the synthesis of Pseudobaptigenin and 
5*-hydroxy Pseudobaptigenin produced by formation of a 
methylene dioxybridge in calycosin and pratensein, 
respectively. These enzymes have been identified as 
cytochromes P450 (see Publication 6, for example). 

Others report the possibility that the 
deoxypodophyllotoxin 6-hydroxylase in cultured cells of Linum 
flavum may be cytochrome P450 (see Publication 7, for 
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example) . 

The cytochrome P450 has also been found as an enzyme 
involved in the biosynthesis of berberine, which is a 
benzylisoquinoline silkaloid, in which (S)-tetrahydroberberine 
is synthesized by forming a methylene dioxybridge in 
(S)-tetrahydrocolimt>amine. A gene that encodes this enzyme 
has been cloned from Coptis japonica (see Publication 8, for 
example). 

The cytochroines P450 that catalyze various types of 
reactions as above comprise a superfamily of diverse 
molecular species, which are categorized based on the 
homology of their amino acid sequences. Different molecular 
species of cytochrome P450 belong to the same family if their 
identity is 40% or greater, and to the same sub family if their 
identity is 55% or greater. In the notation used in this 
classification, numbers denote family, and alphabets denote 
sub family (see Publication 9, for example). The tree diagram 
shown in Fig. 3 represents families and their interrelations. 
For example, the cytochrome P450 involved in the 
biosynthesis of berberine has been categorized as "CYP719.'' 

A single plant species include several hundred molecular 
species of cytochrome P450. However, as shown in Fig. 4, only 
a few of them have been identified based on their biochemical 
and physiological functions. 

As a sesame-derived cytochrome P450, a gene 
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(AY065995) that encodes p-coumarate 3-hydroxylase has been 
cloned, though it does not directly relate to the synthesis of 
sesamin or piperitol concerning the present invention. 

Cloning of cytochrome P450 genes and their functional 
analysis have also been reported for various species of other 
organisms, as noted below. 

For example, some of the cloned genes include: 

A gene coding for petunia-derived flavonoid 3% 5' 
hydroxylase (F3% 5'H) (see Publication 10, for example); 

A gene coding for flavonoid 3 '-hydroxylase (F3'H: 
CYP75B) (see Publication 11, for example); 

A gene coding for sweetroot-derived (2S)-flavanon 
2-hydroxylase (F2H: CYP93B1) (see Publication 12, for 
example); 

A gene coding for 2-hydroxy-isoflavanon synthetase (IFS: 
CYP93C2) (see Publication 13, for example); and 

A gene coding for isoflavon 2 '-hydroxylase (I2'H: 
CYP81E1) (see Publication 14, for example). 

In addition, a gene coding for flavon synthetase II 
(FNSII: CYP93B3) has also been cloned from Antirrhinum 
majus, using I2'H (see Publication 15, for example). 

Amino acid sequences of various cytochromes P450 that 
belong to the CYP81 family are found in 
Http:// drnelson.utmem.edu/CytochromeP450.htmL Among 
different functions of these cytochromes P450, Helianthus 
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tuberosus-derived CYP81B1 is known to catalyze the 
hydrogenation of fatty acids (see Publication 16, for example). 
It is also known that the enzyme 12 'H belongs to CPY81E. 

However, none of these cytochromes P450 is involved in 
the formation of methylene dioxybridge. 

As a gene that encodes an enzyme involved in the 
biosynthesis of lignan, a gene encoding a dirigent protein 
involved in thte synthesis of pinoresinol in Forsythia 
intermedia has been reported (see Publication 17, and Patent 
Publication 3, for example). There have also been reports on a 
gene encoding a Pinoresinol-Lariciresinol reductase in 
Forsythia intermedia (see Publication 18, and Patent 
Publication 3, for example), and a gene encoding a 
Pinoresinol-Lariciresinol reductase in Thuja plicata (see 
Publication 19, for example). In other reports, a recombinant 
secoiso lariciresinol dehydrogenase and its use are discussed 
(see Publication 20, for example). 

[Patent Publication 1] 

Japanese • laid-open publication No. 139579/2001 
(published on May 22, 2001) 
[Patent Publication 2] 

Japanese laid-open publication No. 7676/1998 
(published on January 13, 1998) 
[Patent Publication 3] 

Japanese PCT laid-open publication No. 507931/2001 
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[Patent Publication 4] 

Japanese PCT laid-open publication No. 512790/2002 
(published on May 8, 2002} 
[Publication 1] 

Mitsuo, Namiki, "Sesame, science and its functions," 
published by Maruzen Planet 
[Publication 2] 

Publication of Japan Society for Bioscience, 
Biotechnology, and Agrbchemistry, 76 805-813 2002 
(Publication 3] 

Lignans: biosynthesis and function. Comprehensive 
natural products chemistry vol. 1. 640-713, 1999 
[Publication 4-] 

Phytochemisity, 49, 387, 1998 
[Publication 5] 

Phytochemisity, 30, 2953, 1991 
[Publication 6] 

Phytochemisity, 41, 457, 1996 
[Publication Z] 
Planta, 214, 288, 2001 
[Publication 8] 

J Biol Chem. 2003, May 5 [Epub ahead of print] 
[Publication 9] 

Nelson et al. Pharmacogenetics 6, 1-42, 1996 
[Publication L O] 
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Nature 366 276-279, 1993 

[Publication 11] 

Plant J. 19, 441-451, 1999 

[Publication 12] 

FEES Letters, 431,287, 1998 

[Publication 13] 

Plant Physiology, 121, 821, 1999 
[Publication 14] 

Biochemical and Biophysical Research Communications, 
251, 67, 1998 

[Publication 15] 

Plant and Cell Physiology, 40, 1182, 1999 

[Publication 16] 

J. Biol. Cliem. 273, 7260, 1998 

[Publication 17] 

Plant Physiol. 123, 453, 2000 

[Publication 18] 

J. Biol. Ctiem. 271, 29473, 1996 
[Publication 19] 
J. Biol. Chiem. 271, 618, 1999 
[Publication 20] 

J. Biol. Chem. 2001 Apr 20;276(16): 12614-23, Epub 
2001 Jan 18 

As described above, sesamin has a variety of 
physiological activities, which are known to be effective in 
improving various deficiencies. However, conventional 
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sesamin production solely relied on a method using sesame 
seeds alone. In other words, sesamin production is completely 
dependent on sesame seeds. Consequently, it has been 
difficult to improve productivity or reduce the cost of sesamin 
production. 

The problem can be solved effectively by genetic 
engineering tecliniques. However, to this date, no enzyme has 
been purified for the two kinds of cytochromes P450 that are 
known to be involved in the biosynthesis of sesamin in 
sesame seeds, a.nd no gene for encoding these enzymes has 
been cloned. Th.e situation is the same for the cytochromes 
P450 that form the methylene dioxybridge in different species 
of other organisms, or for the other kinds of cytochromes 
P450. There have been cloned genes encoding enzymes 
involved in the biosynthesis of lignan in different species of 
other organisms. However, none of them is involved in the 
synthesis of sesamin and/ or piperitol. 

Some of the genes derived from sesame seeds have been 
cloned, examples of which include AF240004, AF240005, and 
AF240006 encoding globulins as deposit proteins of seeds. 
Other examples include genes or enzymes involved in the 
synthesis or storage of lipids, including oleosin (J. Biochem. 
122: 819-24, 1997), an acyl carrier protein desaturase (Plant 
Cell Physiol, 1996, 37, 201-5), Steroleosin (Plant Physiol. 
2002, 128: 1200-11), and a fatty acid unsaturase (Plant Sci. 
161 935-941 (2001)). However, none of these genes or 



wo 2005/030944 



PCT/JP2004/014696 



- 10 - 

enzymes is involved in the synthesis of lignan. 

In other words, there has not been found a gene that 
encodes an enzyme involved in the synthesis of lignan. 
Accordingly, there is a strong need for identification of such 
enzymes, and genes encodinjg these enzymes. 

The present invention was made in view of the foregoing 
problems, and an object of the invention is to provide a 
producing method of sesamin and/ or piperitol with a gene 
encoding a sesame-derived enzyme, using, for example, 
recombinant organisms. The object is achieved by identifying 
a gene encoding an enzyme that catalyzes the formation of 
methylene dioxybridge between the hydroxyl group and methyl 
group of lignan, or more preferably a gene encoding an 
enzyme that catalyzes a reaction forming piperitol from 
pinoresinol, and /or a reaction forming sesamin from piperitol. 

DISCLOSURE OF INVENTION 

The inventors of the present invention diligently worked 
to solve the foregoing problems. In achieving the object, a 
group of sesame-derived cytochrome P450 genes (hereinafter 
"SiP genes'*) was obtained from a cDNA library of sesame seed, 
and these gen.es were expressed in yeasts. Microsome 
fractions were collected from the recombinant yeasts, and 
were allowed to react with pinoresinol or piperitol. The 
presence or absence of piperitol from pinoresinol, or sesamin 
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from piperitol was examined by HPLC analysis after the 
reaction. From the result of HPLC analysis, a protein, or a 
gene encoding it, that catalyzes the reaction producing 
piperitol from pinoresinol, and sesamin from piperitol was 
identified. 

Specifically, the present invention provides industrially 
useful substances and methods, as set forth in (1) through 
(21) below. 

(1) A gene encoding a protein that catalyzes biosynthesis 
of piperitol and/ or sesamin. (A gene encoding a protein that 
catalyzes the biosynthesis of piperitol from pinoresinol, 
and/ or sesamin from piperitol.) 

(2) A gene encoding a protein that catalyzes a reaction 
forming a methylene dioxybridge in pinoresinol and/ or 
piperitol. 

(3) A gene encoding a protein that catalyzes biosynthesis 
of piperitol and/ or sesamin, and that consists of (a) an amino 
acid sequence of SEQ ID NO: 1, 64 or 78, or (b) an amino acid 
sequence that has been modified by substitution, deletion, 
insertion, and./ or addition of one or more amino acids of SEQ 
ID NO: 1, 64 or 78. 

(4) A gene encoding a protein that catalyzes biosynthesis 
of piperitol and/ or sesamin, and that consists of an amino 
acid sequence which is at least 50% homologous to an amino 
acid sequence of SEQ ID NO: 1, 64 or 78. 
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(5) A gene including a base sequence of SEQ ID NO: 2, 
65 or 79 as an open reading frame region. 

(6) A gene encoding a protein that catalyzes biosynthesis 
of piperitol and /or sesamin, and hybridizing under stringent 
conditions with (a) a polynucleotide consisting of a base 
sequence of SEQ ID NO: 2, 65 or 79, (b) a polynucleotide 
encoding a protein consisting of an amino acid sequence of 
SEQ ID NO: 1, 64 or 78, or (c) a fragment of the 
polynucleotide (a) or (b). 

(7) A gene as set forth in any one of (1) through (6), 
which is derived from sesame. 

(8) A protein encoded by a gene as set forth in any one of 
(1) through (7). 

(9) A protein catalyzing biosynthesis of piperitol and/ or 
sesamin, and consisting of (a) an amino acid sequence of SEQ 
ID NO: 1, 64 or- 78, or (b) an amino acid sequence that has 
been modified by substitution, deletion, insertion, and/ or 
addition of one or more amino acids of SEQ ID NO: 1, 64 or 
78. 

(10) An antibody that recognizes a protein as set forth in 
(8) or (9). 

(11) A recombinant expression vector including a gene of 
any one of (1) ttirough (7). 

(12) A transformant comprising a recombinant 
expression vector including a gene of any one of (1) through 
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(7). 

(13) A producing method of a protein, comprising the 
steps of: 

incubating or growing a transformant of (12); and 
obtaining from the transformant a protein that catalyzes 
biosynthesis of piperitol and/ or sesamin. 

(14) A plant, its offspring, and a tissue of the plant and 
its offspring, into which a gene of any one of (1) through (7) 
has been introduced. 

(15) A producing method of piperitol and/ or sesamin, 
comprising the step of using a gene of any one of (1) through 
(7), or a protein of (8) or (9). 

(16) A producing method of a transformant containing a 
large amount of lignan, comprising the step of using a gene of 
any one of (1) through (7). 

(17) A producing method of a plant containing a large 
amount of piperitol and/ or sesamin, comprising the step of 
using a gene of any one of (1) through (7). 

(18) A producing method of a transformant containing a 
small amount of lignan, comprising the step of using a gene of 
any one of (1) through (7). 

(19) A producing method of a plant containing a small 
amount of piperitol and/ or lignan, comprising the step of 
using a gene of any one of (1) through (7). 

(20) A method of cultivating sesame, comprising the step 
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of using a gene of any one of (1) through (7). 

(21) A gene detecting device comprising a polynucleotide 
probe whose base sequence is at least part of a base sequence 
of a gene set forth in any one of (1) through (7), 

BRIEF DESCRIPTION OF DRAWINGS 

Fig. 1 is a schematic diagram showing a typical 
synthesis pa.th of sesame lignan, wherein 1 is conipheryl 
alcohol, 2 is pinoresinol, 3 is piperitol synthetase, 4 is 
piperitol, 5 is sesamin synthetase, and 6 is sesamin. 

Fig. 2a through Fig. 2f are diagrams showing results of 
HPLC measuring activity of an enzyme encoded by SiP189 
gene. 

Fig. 3 is a tree diagram obtained from the analysis of the 
primary structure of amino acids encoded by cytochrome 
P450. 

Fig. 4 is a diagram showing a result of homology search 
with a protein encoded by SiP189 gene. 

Fig. 5 is a diagram showing a result of Southern analysis 
on SiP189 gene. 

Fig. 6a through Fig. 6d are diagrams showing a result of 
HPLC analysis measuring activity of a pjrotein encoded by 
SrSiPlBQ gerxe. 

Fig. 7 is a diagram showing a result of expression 
analysis on SiP gene in transformant tabaco. 
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Fig. Set through Fig. 8d are diagrams showing a result of 
functional a.nalysis on SiP protein in plant cells. 

Fig. 9A through Fig. 9H are diagrams showing a result of 
analysis on a promoter region of SiP 189 gene in search for an 
expression regulatory element. 

Fig. 10 is a diagram showing expression regulatory 
elements present in the promoter region of SiP189 gene. 

Fig. 1 lA through Fig. IIH are diagrams showing a result 
of analysis on a promoter region of SrSiP189 gene in search 
for an expression regulatory element. 

Fig. 12A through Fig. 12C are diagrams representing 
homology between SiP189 gene and SrSiPgene. 

BEST MODE FOR CARRYING OUT THE INVENTION 

The following will describe an embodiment of the present 
invention. It should be appreciated that the present invention 
is not limited in any way by the following descriptions. 

(1) Genes according to the present invention, and 
structures of proteins encoded by the genes 

A gene according to the present invention encodes a 
protein for catalyzing the biosynthesis of piperitol and/ or 
sesamin. As used herein, the "biosynthesis of piperitol and/ or 
sesamin* means biosynthesis of piperitol from pinoresinol, 
and/ or sesa.min from piperitol. More specifically, it refers to a 
reaction forming a methylene dioxybridge in the pinoresinol 
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and/or piperitoL In the embodiment, the invention will be 
described through gene SiP189 (with a base sequence of SEQ 
ID NO: 2) that encodes a seed-derived protein for catalyzing 
the biosynthesis of piperitol and sesamin. In the present 
invention, the open reading frame is the region from the start 
codon to the end codon, excluding the end codon. 

(1-1) Genes according to the present invention 
As used herein, the term "gene'* is interchangeable with 
''polynucleotide", ^'nucleic acids'*, or "nucleic acid molecule*'. 
That is, the meaning of "gene" includes a polymer of 
nucleotides. Further, as used herein, the term "base 
sequence" is interchangeable with "nucleic acid sequence" or 
"nucleotide sequence", and it is represented by a sequence of 
deoxyribonucleotides (denoted by A, G, C, and T). 

A gene of the present invention may code for the amino 
acid sequence of SEQ ID NO: 1, 64 or 78, for example. It 
should t>e noted here that proteins are generally known to 
retain their enzyme activities even if their amino acid 
sequences were modified by addition, deletion, and/ or 
substitution of several amino acids. In this connection, a gene 
of the present invention, which encodes a protein that 
catalyzes the biosynthesis of piperitol and sesamin, may 
encode (a) a protein consisting of the amino acid sequence of 
SEQ ID NO: 1, 64 or 78, or (b) a protein consisting of the 
amino a.cid sequence modified by substitution, deletion. 



wo 2005/030944 PCT/JP2004/014696 

- 17 - 

insertion, and/ or addition of one or more amino acids of SEQ 
ID NO: 1, 64 or 78 yet retains its catalytic action in the 
biosynthesis of piperitol and/ or sesamin. For example, the 
present invention includes a gene having an open reading 
frame (ORF) with the base sequence of SEQ ID NO: 2, 65 or 79. 
As will be described later in Examples, a gene according to 
the present invention may also encode a cytochrome P450 
protein. 

As used herein, the "substitution, deletion, insertion, 
and/ or addition of one or more amino acids" means the 
substitution, deletion, insertion, and/ or addition of preferably 
no more than 10, more preferably no more than 7, and further 
preferably no more than 5 amino acids, as enabled by a 
conventional mutant protein producing method (e.g., 
site-directed mutagenesis inducing method, Hashimoto-Gotoh, 
Gene 152, 271-275 (1995)). Therefore, it can be said that 
protein (b) is encoded by a gene that encodes a mutant of 
protein (a). As used herein, "mutant protein** generally refers 
to mutant proteins that are artificially prepared by a 
conventional mutant protein producing method. However, the 
mutant protein may be isolated and purified from a natural 
source as well. 

A gene of the present invention is not limited to 
double-stranded DNA, and may be the sense strand or 
anti-sense strand of double-stranded DNA or RNA. The 
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anti-sense strand may be used as a probe or anti-sense 
medicament. For DNA, cDNA or genomic DNA obtained by 
cloning techniques, chemical synthesis techniques, or a 
combination of these different techniques may be used. 
Further, a gene according to the present invention may 
include a sequence of an untranslated region (UTR), or a 
vector sequence (including expression vector sequence), in 
addition to the sequences coding for the amino acids of 
proteins (a) and (b). 

Further, a gene of the present invention may encode a 
protein that has no less than 20%, preferably no less than 
50%, and more preferably no less than 60% or 70% homology 
to the amino acid sequence of SEQ ID NO: 1, 64 or 78, and 
that catalyzes the biosynthesis of piperitol and/ or sesamin. 
The gene may be used to control a lignan level or cultivate 
plants. As the term is used herein, ''homology*' refers to a 
proportion of matched amino acid sequences. A higher 
homology means a closer relation between the amino acid 
sequences compared. 

Further, a gene of the present invention may include a 
polynucleotide consisting of a base sequence of SEQ ID NO: 2, 
65 or 79, a polynucleotide coding for a protein consisting of 
an amino acid sequence of SEQ ID NO: 1, 64 or 78, or 
fragments of these polynucleotides. For example, a gene of the 
present invention may hybridizes under stringent conditions 
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with a polynucleotide having a base sequence coding for six or 
more amino acids (i.e., 18 bases), and may encode a protein 
that catalyzes the biosynthesis of piperitol and/ or sesamin. 
As used herein, ''under stringent conditions'* means that 
hybridization takes place only when there is at least 80% 
identity, preferably at least 95% identity, and more preferably 
at least 97% identity. Specifically, hybridization may take 
place under 5 x SSC at 50°C, for example. 

Hybridization may be carried out by a conventional 
method, as described in J. Sambrook et al. Molecular Cloning, 
A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory 
(1989), for example. Generally, the level of stringency 
increases with increase in temperature and/ or decrease in 
salt concentration (more difficult to hybridize). A suitable 
hybridization temperature depends on the base sequence or 
the length of the base sequence. For example, when a DNA 
fragment consisting of 18 bases coding for 6 amino acids is 
used, temperatures of not more than 50°C are preferable. 

The hybridization may select a gene derived from a 
natural source, for example, such as plants belonging to 
family Bryophyta. Genes derived from other natural sources 
may be used as well. Further, a gene selected by hybridization 
may be cDNA or genomic DNA. 

(1-2) Proteins according to the present invention 

A protein according to the present invention is encoded 
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by a gene of the present invention described in (1-1) above. 
Encoded by a gene of the present invention, the protein 
catalyzes the biosynthesis of piperitol from pinoresinol, or 
sesamin from piperitol. The present invention also includes a 
protein that catalyzes the reaction forming a methylene 
dioxybridge in the pinoresinol or piperitol- For example, a 
protein of the present invention may include (a) the amino 
acid sequence of SEQ ID NO: 1, 64 or 78, or (b) the amino 
acid sequence modified by substitution, deletion, insertion, 
and/or addition of one or more amino acids of SEQ ID NO: 1, 
64 or 78 yet retains its catalytic action in the biosynthesis of 
piperitol and/ or sesamin. 

Using a gene of the present invention described in (1-1) 
above, a protein according to the present invention may be 
obtained by introducing the gene into a host cell and 
expressing the gene therein. Alternatively, a protein may be 
obtained by isolating and purifying from cells or tissues. 
Further, a protein according to the present invention may be 
a fusion protein with other proteins. Further, a protein 
according to the present invention may be chemically 
synthesized. 

It should be noted that a protein according to the 
present invention is not limited to polypeptides as chains of 
amino acids forming peptide bonds. The protein may be a 
complex protein of a structure other than the polypeptide, or 
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a protein with additional polypeptides. Examples of additional 
polypeptides include various epitopes such as His, Myc, and 
Flag, tagged to a protein of the present invention. 

(2) Method of obtaining genes and proteins of the 
present invention 

A method of obtaining (producing) a gene and protein of 
the present invention is not particularly limited. The following 
describes some representative methods. 

(2-1) Method of obtaining genes 

For genes with natural base sequences, a gene according 
to ttie present invention may be obtained by screening of 
cDNA libraries for example, as will be described in detail later 
in Bx:amples. 

Alternatively, a gene according to the present invention 
may be obtained with the use of amplification means such as 
PGR. For example, PGR may be carried out with primers 
designed according to the sequences at the 5' end and 3' end 
of a cDNA sequence (or the complementary sequences thereof) 
of a gene according to the present invention, using the 
genomic DNA (or cDNA) as a template, for example. The PGR 
amplifies a region of DNA flanked by the primers, thereby 
obtaining a large amount of DNA fragments containing a gene 
of the present invention. 

In another method, a polynucleotide having a desired 
sequence is synthesized by conventional chemical synthesis 
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procedures, based on gene sequence information. 

DNTA encoding an enzyme having a modified sequence of 
amino acid may be synthesized by PGR or ordinary 
site-directed mutagenesis, using DNA with a natural base 
sequence. For example, in order to obtain modified DNA 
fragments of interest, natural cDNA or genomic DNA is 
treated with restriction enzymes to obtain DNA fragments. 
Using tine DNA fragment as a template, PGR or site-directed 
mutagenesis is carried out with primers that have been 
mutated to desired sequences. The mutated DNA fragments 
are then joined to polynucleotides (DNA) which encode an 
enzyme of interest and from which regions of the 
corresponding DNA fragments have been removed. For 
example, DNA that encodes an enzyme with a shorter amino 
acid sequence may be obtained with the use of certain 
restriction enzymes that cuts DNA encoding an amino acid 
sequence (e.g., full length amino acid sequence) longer than 
an amino acid sequence of interest. If the resulting DNA 
fragments do not encode the entire amino acid sequence of 
the enzyme of interest, a DNA fragment that makes up the 
deficient sequence may be synthesized and joined. 

The resulting gene is expressed in the gene expression 
system of Escherichia coli or yeasts to obtain a protein it 
encodes . The enzyme activity of the protein may be evaluated 
to confirm whether the gene has encoded a protein that 
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catalyzes the biosynthesis of piperitol from pinoresinol, or 
sesaxnin from piperitol. Further, with the gene introduced into 
a cell, the protein catalyzing the biosynthesis of piperitol from 
pinoresinol, or sesamin from piperitol can be obtained. 

Further, with an antibody against a protein consisting of 
the a.mino acid sequence of SEQ ID NO: 1, 64 or 78, the gene 
encoding the enzyme for catalyzing the synthesis of piperitol 
and sesamin can be also cloned from other organisms. 

(2-2) Method of obtaining proteins 

A method of obtaining (producing) a protein of the 
present invention is not particularly limited. For example, 
simple purification from cells or tissues expressing the 
protein may be used. The cells or tissues expressing a protein 
of th.e present invention may be self-generating. For example, 
cells or tissues infected with a recombinant baculovirus may 
be used. Purification is not limited to a particular method, 
and may be carried out as follows. After a transformed host 
by an expression vector including a gene of the present 
invention is cultured, cultivated, or bred, a protein of interest 
may be collected and/ or purified from the culture according 
to ordinary procedures, including filtration, centrifugation, 
cell- disruption (cell lysis), gel chromatography, and ion 
exchange chromatography, for example. 

Another way to obtain a protein of the present invention 
is to use recombinant techniques. As one example, a gene 
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according to the present invention is inserted in a carrier 
such as a vector, introduced into host cells by a conventional 
method., and expressed therein. The translated protein is 
purified. Specific methods of gene transfer (transformation) 
and gene expression will be described later. 

Tlie protein for catalyzing the synthesis of piperitol and 
sesamin can also be obtained with the use of an antibody 
against a protein consisting of the amino acid sequence of 
SEQ ID NO: 1, 64 or 78. Further, with the antibody, the 
enzyme catalyzing the biosynthesis of piperitol and sesamin, 
and th.e gene encoding the enzyme can also be cloned from 
other organisms. 

Note that, in introducing the foreign gene into a host, 
various types of hosts and expression vectors can be used. 
Here, the expression vectors have been integrated with a 
promoter that functions to express the gene in the host. A 
type of host or expression vector used is suitably selected 
according to intended use. Different purification methods are 
available for the product proteins, depending on the type of 
host u.sed and the characteristics of the protein of interest. 
Regardless of the type of purification method used, use of a 
tag or the like helps purification of the protein of interest 
with relative ease. 

A. method of preparing a mutant protein is not 
particialarly limited either. For example, a mutant protein may 
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be generated by introducing a point mutation in the base 
sequence using conventional mutant protein inducing 
methods, including a site-directed mutagenesis (see 
Hastiimoto-Gotoh, Gene 152, 271-275 (1995), for example) or 
PGR. Alternatively, a mutant protein may be generated by a 
method in which mutant strains are produced by insertion of 
transposons. With any of these methods, the base sequence of 
cDNA encoding the protein (a) can be altered by substitution, 
deletion, insertion, and/or addition of one or more bases, so 
as to produce a mutant protein. For the preparation of 
mutant proteins, a commercially available kit may be used as 
well . 

Other than the foregoing methods, a protein according to 
the present invention may be obtained by chemical synthesis 
using a commercially available peptide synthesizer, for 
example. Alternatively, a protein according to the present 
invention may be synthesized from a gene according to the 
present invention, using a cell-free protein synthesis solution 
as described in Proc. Natl. Acad. Sci. USA, 78, 5598-5602 
(1981), J. Biol. Chem., 253, 3753-3756 (1978). 

(3) Antibody according to the present invention 
An antibody according to the present invention 
(polyclonal antibody or monoclonal antibody) is obtained 
according to a conventional method, by using a protein of the 
present invention described in Section (1-2) above as an 
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antigen (for example, protein (a), (b), or fragments of these 
proteins). Examples of the conventional method include 
Harlow et al.; Antibodies: A laboratory manual (Cold Spring 
Harbor Laboratory, New York (1988), and Iwasaki et al.; 
Monoclonal antibody, hybridoma and ELIZA, Kodansha (1991). 
The antibody may be used in the detection and/ or 
measurement of a protein according to the present invention. 

(4) Recombinant vectors according to the present 
invention 

A recombinant expression vector according to the 
present invention include a gene of the present invention 
described in Section (1-1) above (e.g., gene encoding protein 
(a) or (b)). A recombinant expression vector that has 
incorporated cDNA is one example of a recombinant 
expression vector of the present invention. The recombinant 
expression vector may be generated using a plasmid, phage, 
or cosmid as non-limiting examples. Further, the recombinant 
expression vector may be generated by a convention method. 

The type of vector is not particularly limited as long as it 
is expressed in a host cell. Specifically, such vectors are 
prepared by incorporating a gene of the present invention in a 
plasmid along with a promoter sequence that has been 
suitably selected to ensure gene expression. The promoter 
sequence depends on the type of host cell. 

Various markers may be used to confirm whether a gene 
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of the present invention has been introduced in a host cell, or 
whether the protein encoded by * the gene has been 
successfully expressed in the host cell. The marker (e.g., a 
gene lacking in the host cell) is integrated with a carrier, 
such as a plasmid, together with a gene of the present 
invention, and is introduced into the host cell as an 
expression vector. Successful uptake of the gene of the 
present invention may be confirmed by checking the 
expression of the marker in the host cell that has 
incorporated the expression vector. Alternatively, a protein 
according to the present invention may be expressed in the 
form of a fusion protein in the host cell. For example, a 
proteirx according to the present invention may be expressed 
as a fusion protein with a green fluorescence protein (GFP) 
derived, from Aequorea victoria. In this case, the GFP is used 
as a marker. 

The host cell is not particularly limited, and various 
conveixtionally available cells may be used. Specifically, as a 
prokaryotic host, bacteria belonging to genus Escherichia (for 
example, Escherichia coZi), or genus Bacillus (for example, 
BacillvLs subtilis) may be used. As a eukaryotic host, lower 
eukaryotes (for example, eukaryotic bacteria including true 
fungi (e.g., yeasts, filamentous fungi)) may be used. Yeasts 
may be microorganisms that belong to genus Saccharomyces 
(for example, Saccharomyces cerevisiae). Examples of molds 
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(filamentous fungi) include microorganisms that belong to 
genus Aspergillus (for example, Aspergillus oryzae, Aspergillus 
niger), or genus Penicillum. Further, the host cell may be an 
animal cell or a plant cell. The plant cell may be obtained 
from various plants, including those belonging to family 
Pedaliaceae or Poaceae. As for animal cells, animal cell lines 
obtained from mice, hamsters, monkeys, or human may be 
used, for example. Insect cells (for example, silkworm cell or 
adult silkworm itself) may be used as well. 

A recombinant expression vectors according to the 
present invention includes expression control regions (for 
example, promoter, terminator, and/or origin of replication), 
which vary depending on the type of target host cell. For 
bacteria expression vectors, ordinary promoters, for example, 
such as trc promoter, tac promoter, or lac promoter are used. 
As yeast promoters, a glycelaldehyde triphosphate 
dehydrogenase (GAPDH) promoter, or PH05 promoter is used, 
for example. Amylase promoter or trpC promoter may be used 
as a filamentous' fungi promoter, for example. For animal cell 
hosts, viral promoters (for example, SV40 early promoter, 
SV40 late promoter) are used. The expression vector may be 
prepared according to ordinary method, for example, using 
restriction enzymes and/ or ligase. Transformation of hosts by 
the expression vector may also be carried out according to 
ordinary method. 
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A method of introducing the expression vector into the 
host cell (method of transformation) is not particularly limited 
and various conventional methods may be used, including an 
electroporation method, calcium phosphate method, liposome 
method, DEAE dextran method, and micro injection method, 
for example. 

(5) Transformants according to the present invention 

A transformant according to the present invention has 
incorporated a gene of the present invention described in 
Section (1-1) above (for example, a gene encoding protein (a) 
or (b)). As used herein, "incorporation of a gene" means 
transfer of a gene into a target cell (host cell) by conventional 
genetic engineering techniques. Further, as the term is used 
herein, ''transformant'' means not only cells, tissues, or 
organs, but also living organisms themselves. 

A transformant according to the present invention may 
be prepared (produced) using a recombinant expression vector 
of the present invention described in Section (4) above. The 
organisms to be transformed are not particularly limited, and 
may be microbes or plants as exemplified above. With a 
suitable promoter or vector, animals and insects can be 
transformed as well. 

In preferred embodyment, a sesame is used to generate a 
transformant according to the present invention. The method 
of generating a sesame transformant include a known method 
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described in , for example, T. Asamizu: Transformation of 
sesame plants using MAT vector system: introduction of fatty 
a.cid desaturase genes. Sesame Newsletter 16: 22-25 (2002). 

Further, as the term is used herein, "transformant** also 
includes plants into which genes of the present invention 
have been transferred, and their offspring having the same 
characteristics. Tissues of these plants also fall within the 
meaning of *transformant". 

(6) Gene detecting device according to the present 
invention 

A gene detecting device according to the present 
invention comprises a polynucleotide as a probe consisting of 
a base sequence, or its complementary sequence, which is at 
least a portion of a gene according to the present invention. A 
gene detecting device of the present invention can be used 
under a variety of conditions for the detection and/ or 
measurement of expression pattern of a gene of the present 
invention. 

One example of a gene detecting device according to the 
present invention is a DNA chip, in which a probe that 
specifically hybridizes with a gene of the present invention is 
immobilized on a substrate (carrier). As used herein ""DNA 
chip'* generally means a synthetic DNA chip that uses a 
synthetic oligonucleotide as a probe. However, it also includes 
an adhesion DNA microarray that uses cDNA as a probe 
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produced by PGR, for example. 

As used herein, the term "oligonucleotide'' means a 
chain of several to several ten nucleotides joined together, 
and it is used interchangeably with "polynucleotide". The 
oligonucleotide may be generated or chemically synthesized as 
a. fragment of a longer polynucleotide. 

The base sequence of the polynucleotide used as a probe 
may be determined by a conventional method of specifying a 
characteristic sequence of cDNA sequences. (For example, a 
SAGE (Serial Analysis of Gene Expression) method, as 
described in Science 276:1268, 1997; Cell 88: 243, 1997; 
Science 270: 484, 1995; Nature 389: 300, 1997; US patent No. 
5,695, 937 may be used.) 

The DNA chip may be made by a conventional method. 
For example, when a synthetic oligonucleotide is used, it may 
be synthesized on a substrate by a combination of 
photolithography and solid phase DNA synthesis technique. 
On the other hand, when the oligonucleotide is cDNA, it is 
stuck on a substrate using an array device. 

Further, as in common DNA chips, the accuracy of gene 
detection can be improved by placing a perfect-match probe 
(oligonucleotide) with a mismatch probe that has been 
prepared by substituting a single base of the perfect-match 
probe. Further, in order to detect different genes 
simultaneously, a DNA chip may be prepared in which 
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different types of oligonucleotides are immobilized on a single 
substrate. 

(7) Use (effectiveness) of genes and proteins according to 
the present invention 

The foregoing descriptions mainly discussed the enzyme 
that catalyzes the biosynthesis of piperitol and/ or sesamin in 
sesame. However, the present invention is not just limited to 
sesame-derived genes, and it also relates to use of the enzyme 
that catalyzes the biosynthesis of piperitol and/ or sesamin. 
The enzyme for catalyzing the biosynthesis of piperitol and/ or 
sesamin may be derived from plants, animals, or 
microorganisms. Regardless of where the enzyme derives from, 
the enzyme can be used to control lignan level as long as it 
has the enzyme activity in the biosynthesis of piperitol and /or 
sesamin. Further, the invention concerns plants with a 
controlled lignan level, and their offspring and tissues, 
obtained by transferring a gene that encodes an enzyme for 
catalyzing the biosynthesis of piperitol and/ or sesamin. The 
plant may be in the form of cut flowers. A gene of the present 
invention that encodes an enzyme for catalyzing the 
biosynthesis of piperitol and/or sesamin can be used to 
generate piperitol or sesamin, or suppress formation of 
piperitol or sesamin. With the current technology, it is 
perfectly possible to transfer the gene in plants and express it 
constitutively or tissue-specifically. . It is also possible to 
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suppress expression of a gene of interest using an anti-sense 
method, co-suppression method, RNAi method, or the like. 
Non limiting examples of plants that can be transformed 
include: sesame, rice, Forsythia suspenta, tabaco, Arabidopsis 
thaliana, bird's-foot trefoil, barley, wheat, rapeseed, potatoes, 
tomatoes, poplar, bananas, eucalyptus, sweet potatoes, soy, 
alfalfa, lupin, corns, cauliflower, roses, chrysanthemum, 
carnation. Antirrhinum majus, cyclamen, orchid, Eustome 
russellianuniy freesia, gerbera, gladiolus. Baby's Breath 
{Grypsophila elegans)y kalanchoe, lily, pelargonium graveolen^ 
geranium, petunia, torenia, tulips, and the like. 

The present invention provides a protein producing 
method including the step of incubating or breeding a 
transformant described in Section (5) above, and the step of 
obtaining from the transformant a protein that catalyzes the 
biosynthesis of piperitol and/or sesamin. The method allows 
for easy and mass production of the protein that catalyzes the 
biosynthesis of piperitol and/or sesamin. The method only 
requires the foregoing steps, and other conditions are not 
particularly limited and may be suitably set (for example, type 
of host, materials used, and settings). 

A gene and protein according to the present invention 
may be used in a producing method of piperitol and/ or 
sesamin in which piperitol and/ or sesamin are biosynthesized. 
The method allows for easy and mass, production of piperitol 
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and/or sesamin, and therefore may be used in manufacture of 
food (health food in particular) and pharmaceuticals 
containing piperitol and/ or sesamin. Note that, as used 
herein **using a gene or protein" means use of a gene or 
protein under various conditions, including in vivo, in vitro, 
and ex vivo. In one example of a producing method of piperitol 
and/ or sesamin, a gene is transferred into a plant cell, and 
piperitol and/or sesamin are biosynthesized in the 
transformant plant in vivo. In another example, a protein of 
the present invention is allowed to react with a precursor of 
piperitol or sesamin ex vivo for the biosynthesis of piperitol 
and/ or sesamin. Further, piperitol and/ or sesamin may be 
produced using a bioreactor, for example. It should be noted 
here that the gene itself does not possess the enzyme activity 
for the biosynthesis. It is therefore preferable that the gene be 
used in the form of a protein it encodes. 

The present invention also provides a producing method 
of a transformant with a high lignan content or low lignan 
content, using genes of the present invention. The method can 
be used to produce a transformant with a controlled lignan 
level, both easily and conveniently. The lignan may be 
extracted, or the transformant itself may be used as a food or 
pharmaceutical, or as a precursor of these products. 

As used herein, ''plants with a high piperitol and/ or 
sesamin content* means plants containing a large amount of 
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piperitol and/or sesamin. As used herein, **plants with a low 
piperitol and/or sesamin content" means plants containing a 
small amount of piperitol and/ or sesamin. 

A gene according to the present invention may be used 
in a method of cultivating sesame. For example, the method 
can be used to transfer a gene of the present invention into 
sesame and cultivate sesame containing a large amount of 
piperitol and/ or sesamin. The transformant sesame produces 
seeds that contain lipids with a high piperitol and/ or sesamin 
level, which can then be collected. 

While the invention is susceptible to various 
modifications and alternative forms, a specific embodiment 
thereof will be described below in more detail by way of 
Examples with reference to the attached drawings. It should 
be understood, however, that it is not intended to limit the 
invention to the particular forms disclosed, but on the 
contrary, the invention is to cover all modifications, 
equivalents, and alternatives falling within the scope of the 
invention as defined in the appended claims. 

[Examples] 

The invention is now described in detail based on the 
following Examples. Unless otherwise noted, the molecular 
biological techniques used in these Examples are based on 
Molecular Cloning (Sambrook et al.). 

[Example 1: Preparation of lignan from Forsythia 
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intermedia] 

Sixty grams of dried Forsythia intermedia leaves was 
boiled in 3 L water for 30 minutes, and the extract was 
filtered with a filter (circle; 300 mm; Tokyo roshi Kaisha). 
After reducing the solvent volume of a filtrate dispensed with 
an evaporator, the sample was freeze-dried (30 /N), and an 
extracted freeze-dried sample was obtained (21.6837 g). 

Seven grams of the extracted freeze-dried sample was 
re-dissolved in 100 ml water and was homogenized by 
ultrasonic wave. Then, a column filled with 400 ml Diaion 
HP-20 resin (Mitsubishi Chemical Corporation) was washed 
with 800 ml of 50% acetone and equilibrated with 2 L water. 
Using the column, the homogenized sample was crudely 
purified. Specifically, after loaded with the sample, the 
column was washed with 800 ml water, followed by first 
elution with 800 ml of 50% methanol, and second elution with 
800 ml of 100% methanol. 

Each sample obtained from the first elution and second 
elution was analyzed by HPLC under the following conditions 
to determine the presence or absence of lignan. For the 
mobile phase, solution A (0.1% trifluoroacetic acid (TFA)) and 
solution B (0.1% TFA, 90% acetonitrile) were used. For the 
column, Develosil C30-UG-5 (Nomura Chemical Co., Ltd., 4.6 
mm X 150 mm) was used. After equilibrating the column with 
a mixture containing 60% solution A and 40% solution B (20 
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minutes), the sample was eluted for 15 minutes at a flow rate 
of 0.6 ml/ minute through a linear gradient of 60% solution A 
and 40% solution B to 10% solution A and 90% solution B. 
The sample was further eluted for another 10 minutes with 
10% solution A and 90% solution B. Proteins were detected at 
an absorption wavelength of 287 nm and were fractionated 
every minute (crude fractionation analysis). Using the 
SPD-IOAV (Shimazu Corporation), each fraction was measured 
in a spectral range of 220 nm to 400 nm to search for a 
substance with two absorption maxima, which are 
characteristic absorption patterns of lignan (230 nm and 280 
nm). The result of measurement showed that a substance with 
the characteristic absorption maxima of lignan was contained 
mainly in the second elute. 

To separate lignan, fractionation was carried out under 
the following conditions (main fractionation). The second 
elute was concentrated with an evaporator, and was 
freeze-dried (dry weight 1.0325 g). One gram of the extracted 
freeze-dried sample was dissolved first in 1 ml dimethyl 
sulfoxide (DMSO), and then in 6 ml of 30% solution B, using 
sonication both times. The elute was separated by 
centrifugation at 15000 rpm, and about 6 ml of the 
supernatant was loaded onto a column (Develosil 
ODS-UG-15/30; C-18 (Nomura Chemical Co., Ltd., 50 mm x 
500 mm)). For the mobile phase, solution A (0.05% TFA) and 
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solution B (0.05% TFA, 90% acetonitrile) were used. After 
equilibrating the column for 20 minutes at a flow rate of 32 
ml/ minute with 70% solution A and 30% solution B, the 
sample was loaded onto the column and was eluted for 60 
minutes with a linear gradient of 70% solution A and 30% 
solution B to 10% solution A and 90% solution B. The sample 
was further eluted for 30 minutes with 10% solution A and 
90% solution B. Detection was made at an absorption 
wavelength of 280 nm, and the elute was fractionated every 
minute (32 ml) (detector: 11 SUV detector, the Gilson product). 
At the end of the procedure, eight fractions with an 
absorption peak at 280 nm were obtained. These fractions 
were concentrated with an evaporator, and were freeze-dried. 
Each fraction was analyzed by HPLC under the same 
conditions as in the crude fractionation analysis. Among 
these fractions with the characteristic spectrum of lignan, 
fractions believed to be pure lignan were analyzed by NMR 
analysis so as to identify pinoresinol. As a result, 49.7 mg of 
pinoresinol was obtained. 

[Example 2: lignan analysis in sesame seeds] 
Immature sesame seeds were collected out of the sheath 
of cultivated sesame, and were freeze-dried to extract lignans 
with acetone. The following describes a preparation method of 
acetone extract. 

About 100 mg of freeze-dried sample of crushed sesame 
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seeds was dissolved in 1 ml of acetone to obtain an acetone 
extract. 100 pi of acetone extract was dried and re-dissolved 
in 20 ]il DMSO, followed by addition of 80 pi of 50% 
acetonitrile containing 0.1% TFA. The solution was filtered 
with the Millex-LH filter (Millipore Corporation, 0.45 pm/4 
mm) to prepare a sample for HPLC analysis. The result of 
HPLC analysis confirmed the presence of lignans, with their 
retention times matching the retention times of piperitol (a 
standard sesame lignan (control)) (retention time of about 12 
minutes), sesamin (retention time of about 16 minutes), and 
sesamolin (retention time of about 18 minutes). 

The sesame seeds were divided into four stages 
according to their growth stages. 

Stage 1: Seed sheath no longer than 1.5 cm 
Stage 2: Seed sheath between 1.5 cm and 2 cm 
Stage 3: Seed sheath 2 cm or greater, yellowish green 
sheath 

Stage 4: Seed sheath 2 cm or greater, dark green 
sheath 

The result of HPLC analysis under these conditions 
confirmed accumulation of piperitol, sesamin, and sesamolin 
in Stage 3 and Stage 4. ^ 

That is, the result indicates the presence of an enzyme, 
and a gene encoding it, involved in the production of piperitol 
and sesamin in the sesame used in this Example. 
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[Example 3: preparation of sesame cDNA library] 

From the sesame seeds used in Example 2, total RNA 
was extracted with the RNeasy Plant Mini Kit (QIAGEN) 
according to the method recommended by the manufacturer. 
From the extracted RNA, 5 p.g of poly A(+) RNA was obtained 
using the oligotex-MAG mRNA purification kit (TaKaRa). Using 
the poly A(+) RNA as a template, cDNA library was 
constructed with the ZAP Express cDNA Synthesis Kit and 
ZAP Express cDNA GigapackS Gold Cloning Kit (Stratagene) 
according to the method recommended by the manufacturer. The 
cDNA library was obtained in 1 x 10''' pfu/ml. 

[Example 4: cloning of sesame-derived lignan 
biosynthetic gene] 

About 300,000 pfu of cDNA library were screened. As 
probes, the sequences of cytochrome P450 gene of Arabidopsis 
thaliana of a known genome sequence were used. 

Specifically, a superfamily of cytochrome P450 genes of 
Arabidopsis thaliana was phylogenetically classified based on 
the primary sequences (see Fig. 3), and 13 kinds of 
Arabidopsis thaliana cytochrome P450 genes (CYP90A, 
CYP72B, CYP71B, CYP84A, CYP96A, CYP710A, CYP86A, 
CYP74, CYP75B, CYP79F, CYP81D, CYP705A, and CYP83A) 
were used as probes for screening the sesame seed library. 
The DNA was amplified with primers of SEQ ID NOs: 5 to 30 
to introduce DIG-label into the probes. The PGR was carried 
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out as follows. After extracting total RNA from Arabidopsis 
thaliana with the RNeasy Plant Mini Kit (QIAGEN), cDNA was 
obtained by reverse transcription using 1 pg of total RNA as a 
template. The synthesis of cDNA was carried out with the 
Superscripf^M First-Strand Synthesis System for RT-PCR 
(Invitrogen) under the conditions recommended by the 
manufacturer. The reaction mixture of PGR (50 pi) contained 
1 pi of the Arabidopsis genomic DNA, 1 x Taq buffer (TaKaRa), 
0.2 mM dNTPs, primers (SEQ ID NOs: 5 to 30, 0.4 pmol/pl 
each), and 2.5 U rTaq polymerase. The reaction was carried 
out at 94°G for 5 minutes, and then in 28 cycles at 94**C for 1 
minute, at 53°G for 1 minute, and at 72°C for 2 minutes. The 
probes were DIG-labeled under the same PGR condition. 
Screening and the detection of positive clones were carried 
out with the DIG-DNA labeling Ss detection kit (Roche). 

The detection of positive clones was carried out under 
reduced stringent hybridization conditions, based mainly on 
the method recommended by the manufacturer. Specifically, 
after 2 -hour pre-hybridization of membrane at 37**G using a 
hybridization buffer containing 5 x SSG, 30% formaldehyde, 
50 mM sodium phosphate buffer (pH 7.0), 1% SDS, 2% 
blocking reagent (Roche), 0.1% lauroyl sarcosine, and 80 
lig/ml sperm DNA of salmon, DIG-labeled mix probes were 
added in the buffer and the membrane was incubated 
overnight. The membrane was washed at 58*^C for 1 hour in a 
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5 X SSC washing solution containing 1% SDS. 

After the second screening, 96 independent positive 
clones were obtained. The 96 clones were inserted in the 
pBK-CMV plasmid (Stratagene) according to the method 
recommended by the manufacturer, and the nucleotide 
sequences at the 5' end and 3' end of the insert region were 
determined with M13RV primer and M13M4(-20) primer, 
respectively. As a result, 46 clones had sequences similar to 
the sequences of the cytochrome P450. The entire nucleotide 
sequence of the insert region was determined. The resulting 
clones were classified into 5 independent P450 homologs 
(Sesamum indicum P450; SiP) SiP168, SiP189, SiP236, SiP249, 
and SiP288. Thereafter, RT-PCR was carried out using 
primers (SEQ ID NOs: 31 to 40) specific to these five SiP 
genes. As a template, cDNA obtained by reverse transcription 
of RNA prepared from the leaves and seeds of sesame was 
used. The result showed that the five SiP genes were 
expressed in the seeds. The reaction mixture of PGR (25 ^il) 
used in RT-PCR contained 1 pi of each cDNA, 1 x Ex-Taq 
buffer (TaKaRa), 0.2 mM dNTPs, primers (0.2 pmol/pil each), 
and 1.25 U Ex-Taq polymerase. The reaction was carried out 
at 94°C for 3 minutes, and then in 26 cycles at 94°C for 1 
minute, at 53°C for 1 minute, and at 72°C for 2 minutes. 
Ribosomal 18SRNA (AJ236041) was used as a sesame internal 
control gene to compare expression levels. Primers of SEQ ID 
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NOs: 3 and 4 were used for amplification. 

[Example 5: generation of transformants including 
expression vector for the Sip gene] 

Among the 5 SiP genes, SiP249 (pSPB2031) and SiP288 
(pSPB2034) were believed to encode the full open reading 
frame (SEQ ID NOs: 53 to 56). A 1.8 kb cDNA fragment 
containing the SiP249 ORF obtained by digesting pSPB2031 
with BamHI and Xhol was inserted at the BamHI and Sail 
sites of the yeast expression vector pYE22m (Holton, T. A et 
al.. Nature 366, 276-279, 1993). As a result, pSPB2046 was 
obtained. The multiple cloning site of the yeast expression 
vector pYE22m is flanked by the glyceraldehydes-3-phosphate 
dehydrogenase gene (GAPDH) promoter and the GAPDH 
terminator, and the insert inserted in the multiple cloning 
site is constitutively expressed in yeasts under the regulation 
of the GAPDH promoter. The selection marker for the vector is 
tryptophan. Meanwhile, a 1.8 kb cDNA fragment containing 
the SiP288 obtained by digesting pSPB2034 with BamHI and 
Xhol was inserted at the BamHI and Sail sites of the yeast 
expression vector pYE22m. As a result, pSPB2047 was 
obtained. The two kinds of yeast expression vector were used 
to transform the yeast INVsc strain (Invitrogen) according to 
ordinary method, so as to obtain INVsc/pYE22m/SiP249 and 
INVsc/pYE22m/SiP288. 

The genes of SiP168, SiP189, and.SiP236 had incomplete 
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open reading frames. In order to obtain sequences with 
complete open reading frames (SEQ ID NOs: 1 and 2, and 57 
to 60), the 5' end of each gene was amplified with the 
GeneRacer kit (Invitrogen) according to the method 
recommended by the manufacturer. Primers of SEQ ID NOs: 
41 to 46 were used for amplification. The sequences amplified 
the 5' end of each SiP gene were determined. 

In order to amplify the full length open reading frames of 
these 3 kinds of SiP genes, the genes were amplified by PGR. 
As a template, cDNA derived from sesame seeds was used. The 
primers used the PGR had restriction enzyme sites as noted 
below. The PGR was carried out as follows. The reaction 
mixture of PGR (50 pi) contained 1 ]il of template cDNA 
derived from sesame seed, 1 x KOD plus buffer (TOYOBO), 0.2 
mM dNTPs, primers (SEQ ID NOs: 47 to 52, 0,4 pmol/vil each), 
ImM MgS04, and 1 U KOD plus DNA polymerase. The reaction 
was carried out at 94°G for 5 minutes, and then in 30 cycles 
at 94**G for 1 minute, at 55°G for 1 minute, and at 72°G for 2 
minutes. The amplified fragment containing the full length 
SiP was inserted in the multiple cloning site of the pCR-blunt 
II TOPO vector (Invitrogen), so as to obtain TOPO-SiP168 
(pSPB2064), TOPO-SiP189 (pSPB2055), and TOPO-SiP236 
(pSPB2048). 

A DNA fragment containing cDNA of about 1.5 kb 
obtained by digesting the restriction enzyme sites of the 
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primers used in the amplification of pSPB2064, pSPB2055, 
and pSPB2048 was inserted at the BamHI and Sail sites of 
the yeast expression vector pYE22m. As a result, 
pYE22m/SiP168 (pSPB2052), pYE22m/SiP189 (pSPB2053), 
and pYE22m/SiP236 (pSPB2049) were obtained. 

The three kinds of yeast expression vectors were used to 
transform the yeast INVsc strain (Invitrogen), so as to obtain 
INVsc/pYE22m/SiP168, INVsc/pYE22m/SiP189, and 

INVsc/pYE22m/SiP236. 

[Example 6: biosynthesis of sesame lignan in the 
transformants] 

In addition to the INVsc/pYE22m/SiP249 and 
INVsc/ pYE22m/SiP288, the three transformants 

INVsc/pYE22m/SiP168, INVsc/pYE22m/SiP189, and 

INVsc/ pYE22m/SiP236 were incubated at 30°C for 36 hours 
in a 400 mL YNBDglc medium (0,67% yeast nitrogen base, 2% 
glucose, and 20 mg/L of various amino acids excluding 
tryptophan). From the culture solutions of the yeast 
transformants, microsome fractions were collected by a known 
ultracentrifugation method (Holton, T. A et al.. Nature 366, 
276-279, 1993). 

The microsome sediments were suspended in 1 ml of 
suspension buffer (O.IM potassium phosphate buffer (pH 7.4), 
20% glycerol, 0.3 |il/ml mercaptoethanol) to obtain a 
microsome solution. To 240 |il of the microsome solution was 
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added 30 tjI of IM potassium phosphate buffer (pH 7.4), 6 \il 
of SOmM NADPH, and 24 \il of 267 i^M pinoresinol or piperitol. 
The mixture was reacted at 30°C for 1 hour. To the reaction 
mixture of enzyme, an equal amount of 100% acetonitrile 
containing 0.1% TFA was added (50% final concentration). 
The mixture was centrifuged at 15000 rpm at 4**C for 3 
minutes, and 150 pi of supernatant was purified with a 
Millex-LH filter (Millipore Corporation, 0.45 iim/4 mm). The 
purified sample was then analyzed by HPLC under the same 
conditions as in the crude fractionation analysis described in 
Example 1. 

The result of HPLC analysis for INVsc/pYE22m/SiP189 
is described below with reference to Fig. 2(a) through Fig. 2(e). 
In the INVsc/pYE22m/SiP189, the pinoresinol (Fig. 2(a), 
retention time of about 8 minutes) converted into two 
products with a lignan-like absorption spectrum with 
retention times of about 12 minutes and about 16 minutes, 
respectively (Fig. 2(b)), Further, in the INVsc/pYE22m/SiP189, 
the piperitol (Fig. 2(d), retention time of about 12 minutes) 
was converted into a product with a lignan-like absorption 
spectrum with a retention time of about 16 minutes (Fig. 2(e)). 
From these retention times, they were considered to be 
piperitol (retention time of about 12 minutes) and sesamin 
(retention time of about 16 minutes), respectively. 

By the LC-MS/MS analysis (LC: Waters 2790, the Waters 
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product; MS: QUATRO micro, the Micromass product), the two 
peaks (retention times of about 12 minutes and 16 minutes) 
were compared with a standard based on their molecular 
weights and fragment patterns. The result showed that their 
molecular weights matched that of the standard, thereby 
identifying the two SiP189 products as piperitol and sesamin. 
The LC-MS/MS analysis was carried out under the following 
conditions. For LC, the Develosil C30-UG-5 (Nomura Chemical 
Co., Ltd., 2.0 X 50 mni) was used. As the mobile phase, 
solution A (H2O), solution B (methanol), and solution C (10 
mM CH3COONH4) were used. The flow rate was 0.25ml/min 
with 10% solution C. Under these conditions, piperitol and 
sesamin were detected after 8.4 minutes and 10.1 minutes. 
MS was carried out in a POSITIVE measurement mode. It was 
found as a result that SiP189 encodes an enzyme that 
synthesizes sesamin from pinoresinol via piperitol. Initially, it 
was believed that different enzymes were involved in the 
synthesis of piperitol from pinoresinol, and the synthesis of 
sesamin from piperitol. The present invention proved the 
contrary, showing that a single enzyme encoded by a gene of 
the present invention is involved in the both reactions. 

From the result of analysis using the reaction system 
that did not contain NADPH in the reaction mixture of enzyme, 
it was found that the activity of INVsc/pYE22m/SiP189 to 
produce piperitol was dependent on NADPH (Figs. 2(b) and 
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2(C)). The same was the case for sesamin (Figs. 2(e) and 2(f)). 
In the absence of NADPH, the activities for producing piperitol 
and sesamin dropped to about 14% and about 18%, 
respectively, from the levels in the presence of NADPH. 

Thereafter, the microsome fraction of the 
INVsc/pYE22m/SiP189 was reduced by CO, and its 
absorption spectrum was measured with a spectrophotometer 
(the Hitachi product U-3000P Spectrophotometer). The result 
showed that the microsome fraction had absorption at 450 nm, 
compared with the transformant yeast INVsc/pYE22m used as 
a control. The result therefore confirmed production of 
cytochrome P450 protein in the microsome fraction of the 
INVsc/pYE22m/SiP189. 

According to the method described in Example 2, RNA 
was extracted from the sesame seeds separated into 4 stages 
according to their growth in Example 2. RT-PCR was carried 
out using a primer set (SEQ ID NOs: 49 and 50) for amplifying 
SiP189, and a primer set (SEQ ID NOs: 3 and 4) for amplifying 
SilSSrRNA. The reaction mixture of PGR (25 ill) contained 1 
111 of each cDNA, 1 ^ Ex-Taq buffer (TaKaRa), 0.2 mM dNTPs, 
primers (0.2 pmol/iil each), and 1.25 U Ex-Taq polymerase. 
The reaction was carried out at 94°C for 5 minutes, and then 
in 26 cycles at 94°C for 1 minute, at 53'*C for 1 minute, and 
at 72'*C for 2 minutes. The result confirmed strong expression 
of SiP189 in Stage 4, in which accumulation of sesamin in the 
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seed becomes notable. From the fact that the growth-stage 
dependent lignan accumulation coincides with the time of 
SiP189 gene expression, the SiP189 gene was found to encode 
the enzyme for producing piperitol and sesamin in the sesame 
seeds. 

The foregoing results show that the SiP189 gene encodes 
cytochrome P450 that catalyzes the reaction producing 
piperitol from pinoresinol, and the reaction producing 
sesamin from piperitol. In Fig. 3, SiP189 is indicated by arrow. 
It can be seen from Fig. 3 that SiP189 belongs to the CYP81 
family in the cytochrome P450 superfamily. 

The gene enables synthesis of sesamin and piperitol 
using various organisms including sesame and other plants, 
or a system such as a bioreactor. 

[Example 7: genomic analysis of SiP189 gene in 
cultivated sesame Sesamum indicum 

In order to find the number of copies of SiP189 gene in 
the S. indicum genome, a genomic Southern analysis was 
carried out. 

Using the Nucleon Phytopure for Plant Extraction Kit 
(Amersham), genomic DNA was extracted from the leaves of S. 
indicum (cultivar Masekin) according to the method 
recommended by the manufacturer. Ten micrograms of the 
genomic DNA so extracted was completely digested by three 
kinds of restriction enzymes EcoRI, Ncol, and Xbal, and each 
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sample was separated by electrophoresis on an agarose gel. 
The agarose gel was hydrolyzed for 15 minutes using 0.25 M 
HCl, denatured for 30 minutes using a solution containing 
1.5 M NaCl and 0.5 M NaOH, and neutralized with a solution 
containing 1.5 M NaCl and Tris-HCl (pH 7.5). The genomic 
DNA was then transferred onto a membrane (Hybribond-N, 
Amersham) in 20 x SSC, and was bound to the membrane by 
irradiation of ultraviolet light. The membrane was subjected 
to pre-hybridization at 42**C for 1 hour in a hybridization 
buffer (high SDS buffer) containing 7% SDS, 50% formamide, 
5 X SSC, 2% blocking reagent, 0.1% lauroyl sarcosine, and 50 
mM sodium phosphate buffer (pH 7.0). 

As a hybridization probe, an ORF region of about 900 bp 
starting from a start methionine of the cDNA of the SiP189 
was used. This region of the cDNA was DIG-labeled by PCR, 
using primers of SEQ ID NO: 61 (Bam-SST-FW2) and SEQ ID 
NO: 62 (SiP189-Nco-RV). The reaction mixture of PCR 
contained 1 ng of plasmid (pSPB2055) containing the cDNA of 
SiP189, 1 X PCR buffer, 1 x DIG-dNTP mixture (PCR DIG 
Labeling Mix, Roche), 0.2 pmol/]il of each primer, and lU 
rTaq polymerase (TAKARA BIO INC.). The PCR was carried out 
in 30 cycles at 95*^0 for 30 seconds, at 53''C for 30 seconds, 
and at 72**C for 1 minute. 

The PCR product was purified with the Sephadex G-50 
quick spin column (Boehringer), and was heat denatured and 
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immediately placed on ice. As a hybridization probe, 10 |il of 
the denatured product was added to a prehybridization 
solution, and the mixture was incubated overnight at 42°C. 

The membrane was washed twice at 65°C for 30 minutes 
in a 0.2 x SSC solution containing 0.1% SDS (high stringent 
hybridization condition). Hybridization signals were obtained 
with the DIG-labeling 85 detection kit (Roche) according to the 
DIG application manual (Roche). 

Fig. 5 shows the result of detection. As shown in Fig. 5, 
the SiP189 gene was detected as a single band for all of the 
three restriction enzyme treatments. The result indicates that 
the SiP189 gene exists as a single gene in the S. indicum 
genome, and that no other genes in the genome have strong 
homology to the SiP189 gene. Therefore, it can be said that 
the catalytic activity for synthesizing piperitol and sesamin in 
sesame plants is conferred by the SiPlSQ gene. 

[Example 8: isolation of SiP189-like gene from Sesamum 
radiatum] 

Sesamum radiatum is a sesame plant found in Africa. A 
cytogenetic analysis has revealed that the number of 
chromosomes in this particular species of sesame plant was 
2n - 64, indicating a cytogenetically different lineage from the 
S. indicum (2n = 26) (Mitsuo Namiki, Teisaku Kobayashi, 
Science of sesame, Asakura Shoten). A lignan content has 
also been analyzed in the seeds of S. radiatum, and 
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accumulation of sesamin has been reported (Bedigian, D., et 
al. Biochemical Systematics and Ecology 13, 133-139, 1985). 
This suggests that S. radiatum should have a gene (SrSiP189) 
encoding an enzyme corresponding to the SiP189 of S. indicum. 
In addition, it is expected that the sequence of the SrSiP189 
gene of the S. radiatum is highly homologous to the sequence 
of SiP189. 

According to the procedure of Example 4, cDNA was 
prepared from the seeds of S. radiatum. Using 1 yil of the 
cDNA as a template, RT-PCR was carried out according to the 
method of Example 4, using primers of SEQ ID No: 61 
(Bam-SST-FW2) and SEQ ID NO: 63 (GR-SST-RVl). The 
primers were designed according to the sequence of SiP189, 
so as to amplify fragments probably containing a full length 
ORF. The RT-PCR produced a fragment of about 1,5 kb, which 
was believed to contain SrSiP189. The fragment was inserted 
in the pCR-blunt II TOPO vector (Invitrogen), and pSPB2068 
was obtained. The entire nucleotide sequence of the inserted 
fragment was determined. The result showed 96% sequence 
homology on the DNA level, and 95% sequence homology on 
the amino acid level, as compared with SiP189 derived from S. 
indicum (SEQ ID NO: 64 shows the amino acid sequence of 
SrSiP189, and SEQ ID NO: 65 shows the nucleotide sequence 
of SrSiP189). According to the procedure of Example 4, 
RT-PCR was carried out using the primers (SEQ ID NOs: 61 
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and 63). As a template, cDNA prepared from the seeds and 
leaves of S, radiatum was used. The result found strong 
SrSiP189 expression in seeds but hardly any in leaves. The 
analysis of SrSiP189 expression by RT-PCR therefore 
indicated that SrSiP189 was functional in seeds. 

[Example 9: functional analysis of SrSiP189 gene in 
Sesamum radiatum] 

In order to determine the biochemical function of 
SrSiP189, a recombinant SrSiP189 protein was expressed in 
yeasts, and the activity of the recombinant SrSiP189 protein 
for the biosynthesis of lignan was examined. First, pSPB2068 
was digested with restriction enzymes BamHI and Xhol, and a 
resulting fragment of about 1.5 bp containing cDNA with a 
full length SrSiP189 was inserted at the BamHI site and Sail 
site of a yeast expression vector pYE22m. As a result, 
pSPB2069 was obtained. According to the procedure of 
Example 6, a microsome was prepared from the yeast 
transformants, and biosynthesis activities for lignan were 
measured. Fig. 6 shows a result of HPLC analysis. As shown 
in Fig. 6, The recombinant SrSiP189 protein had a NADPH 
dependent catalytic activity for converting pinoresinol into 
piperitol, and piperitol into sesamin, as with the SiP189 
derived from S. indicum (Fig. 6(a) and Fig. 6(b)), In an enzyme 
reaction mixture containing no NADPH, the catalytic activities 
for piperitol and sesamin dropped to 16.9% and 8.4%, 
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respectively (Fig. 6(c) and Fig. 6(d)). The result therefore 
showed that the SrSiP189 was indeed a counterpart gene of 
the SiP189 in S. indicum. 

The foregoing result confirmed that a gene with an 
SiP189-like sequence is found across species, and that the 
gene encodes an enzyme that catalyzes the reaction 
converting pinoresinol into piperitol, and piperitol into 
sesamin. 

The invention being thus described, it will be obvious 
that the same way may be varied in many ways. Such 
variations are not to be regarded as a departure from the 
spirit and scope of the invention, and all such modifications 
as would be obvious to one skilled in the art are intended to 
be included within the scope of the following claims. 

[Example 10: functional analysis of SiPlSQ protein in 
plant cells] 

In order to confirm the biological functions of SiP189 
protein in plant cells, tabaco {N. tabaccum) was transformed 
with the SiP189 gene. 

With the restriction enzymes BamHI and Xhol, 
pSPB2055 including SiP189 was digested. A resulting DNA 
fragment of about 1.5 kb including ORF of the SiP189 was 
ligated with the BamHI site and Sail site of a plant 
transforming binary vector pSPB176, so as to obtain a binary 
vector pSPB2057. The multiple cloning site of the pSPB176 is 
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flanked by the CaMVSSS promoter and the NOS terminator. 
The insert inserted in these sites is constitutively expressed 
in excess in the plant cells under the control of CaMVSSS 
promoter. 

The pSPB2057 was transformed into agrobacterium 
(strain: Agio) according to a conventional method (Shimonishi 
et al.. New Introductions to Biological and Chemical 
Experiments 3, Kagaku Doujin (pp. 122-124)). The 
agrobacterium transformant was used to infect a tabaco leaf 
disk. 

From the leaves of thirteen lines of the transformants, 
cDNA was prepared according to the method of Example 2. 
Then, RT-PCR was carried out according to the method of 
Example 4, using primers of SEQ ID NOs: 49 and 50. A 
ubiquitin gene of tabaco (NtUBQ accession No: U66264) was 
amplified as an internal control gene, using primers 
consisting of the nucleotide sequences of SEQ ID NOs: 66 and 
67 (NtUBQ-FW, NtUBQ-RW, respectively). The result 
confirmed the presence of . highly-expressed SiP189 gene in 
lines 6, 7, and 12 (Fig. 7). 

The following procedure was carried out on ice or at 4**C, 
About 15 g of each sample of transformant leaves (lines 6, 7, 
and 12) and non-transformant leaves was crushed with a 
pestle in liquid nitrogen, and was dissolved in 30 ml of 
homogenize buffer (0.1 M potassium phosphate buffer (pH 
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7.0), 0.5 M mannitol, 5 mM EDTA, 42 mM mercaptoethanol, 
50 mM sodium ascorbate, 0.1% BSA, 1 mM PMSF, and 1% 
PVPP). 

The mixture was centrifuged at 10000 x g for 20 minutes, 
and the supernatant was filtered with a Miracloth. The filtrate 
was ultracentrifuged at 100000 x g for 90 minutes to obtain a 
crudely extracted microsome fraction. The microsome fraction 
(240 \xl) was subjected to the enzymatic reaction with piperitol 
according to the method of Example 4, and the resulting 
product was analyzed by HPLC. 

The result of HPLC showed that the tabaco-derived 
microsome expressing SiP189 in excess had a peak which was 
not observed in the non-transformant, and was dependent on 
the presence of NADPH in the reaction mixture of enzyme. The 
peak coincided with the retention time of a sesamin standard, 
indicating the function of the SiP189 as a protein with the 
catalytic activity for the biosynthesis of sesamin in the plant 
cell (Fig. 8), 

[Example il: identification of expression regulatory 
region in SiP189 gene] 

To get the insight of the transcriptional regulation of 
SiP189 gene, a 5' non-coding region of the SiP189 gene was 
isolated, sequenced and analyzed. From the genomic DNA of 
sesame (S. indicum), a genome library was constructed using 
the XBlueSTARTM Vector system (NOVAGEN). 
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200 pg of sesame genomic DNA was partially digested 
with the restriction enzyme SauSAI to obtain fragments of 
about 20 kb. The DNA fragments were subjected to a sucrose 
density gradient centrifugation (10% to 40%) at 25000 rpm, at 
10°C for 24 hours (SW28 rotor, Beckman). The centrifuged 
samples were fractionated (1 ml each) using the AUTOMATIC 
LIQUID CHARGER (Advantec), and the Micro Tube Pump 
(EYELA). For each fraction, the size of fragments was 
confirmed by pulse-field gel electrophoresis. The pulse-field 
gel electrophoresis used a gel with 1% Agarose NA (Amersham 
biosceience) and 0.5 x TBE, and was carried out in a 0.5 x 
TBE buffer for 120°/ 1 sec. to 1 sec. at 6V/cm (CHEF MAPPER, 
Invitrogen). With the fractions containing fragments with an 
average fragment size of about 20 kb, a genome library was 
constructed according to the method recommended by the 
manufacturer. The library had a titer 1.5 x lO^ pfu/500 pi. 
The genome library (500000 clones) was screened with a 
probe, for which about 850 bp ORF region of the SiP189 gene 
amplified with primers consisting of base sequences of SEQ 
ID NOs: 68 and 69 (SiP189-bam-FW and SiP189-nco-RV, 
respectively) was used. 

The probed was labeled and detected with the AlkaPhos 
Direct Labeling and Detection system (Amersham bioscience) 
according to the method recommended by the manufacturer. 
Hybridization was carried out under the following conditions: 
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Probe: 



5 ng/ml hybridization buffer 



Pr ehybr idization : 



55°C for 1 hour 



Hybridization : 



SS'^C overnight 



Washing: 



twice at SS^'C for 30 minutes 



After the third screening, nine kinds of positive clones 
were isolated, and a gSiP189-#6 with an insert size of about 
12 kb was obtained therefrom. 

Then, PGR was carried out using primers of nucleotide 
sequences of SEQ ID NOs: 68 and 69, and phage arm primers 
STAR-LFl (SEQ ID NO: 70) and STAR-LRl (SEQ ID NO: 71), so 
as to determine the direction and position of the probe 
inserted in the gSiP189-#6. 

The PGR analysis indicated gSiP189-#6 contains 5* 
non-coding region of SiP189 gene, no shorter than 5kb. Based 
of this result, the entire nucleotide sequence of the 
gSiP189-#6 was determined. 

The inserts were amplified by LA-PCR, using the 
gSiP189-#6 as a template. The reaction mixture of LA-PCR 
contained 1 pi of positive clone SM buffer suspension liquid, 
1 X LA buffer (TaKaRa), primers (Ipmol/pl each), 0.4 mM 
dNTP, 2 mM MgCb, and 2.5 U LA-Taq polymerase. The 
LA-PCR was carried out at 96°C for 5 minutes, and then in 30 
cycles at 98''C for 10 seconds, at 55**C for 10 seconds, and at 
68°C for 10 minutes. Finally, the product was maintained at 
72**C for 15 minutes. 
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The resulting fragments were physically cut, and DNA 
fragments of about 1 to 2 kb were fractionated. The fragments 
were end-blunted and inserted in the Hindi site of the 
pUCllS (TaKaRa) to construct a shot gun library. The library 
had a titer 2,8 x lo^ cfu/vil. 

With the shot gun library derived from gSiP189-#6, E. 
coli DHIOB strain (Invitrogen) was transformed by an 
electroporation method. From randomly picked 192 colonies, 
DNA was prepared . using the TempliPhi DNA Sequencing 
Template Amplification kit (Amersham bioscience). The DNA 
so prepared was amplified using M13-47(F) primer (SEQ ID 
NO: 72) and RV-M(R) primer (SEQ ID NO: 73). 

The product of amplification was purified with the Clean 
SEQ (Agecourt), and sequenced using the MegaBASE4000 
(Amersham Bioscience). The sequence data was assembled by 
PHRAP (CAP4), and a CONTIG sequence, including a 
5'-sequence of about 13 kb starting from the start methionine 
site of the SiP189 gene, was obtained. 

In order to identify regulatory cis-elements at the 
5'-region of the SiP189 gene, a 5'-sequence of about 3 kb 
starting from the start methionine site of the SiP189 gene 
(SEQ ID NO: 74) was analyzed by PLACE 
(http://www.dna.affrc.go.jp/PLACE/). The PLACE analysis 
identified a large number of binding sites for a specific 
transcription factor family, along with a large number of 
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regulatory cis-elements that responded to a specific signal. 
This indicates their involvement in the expression of the 
SiP189 gene (Fig. 9 and Fig. 10). 

Thereafter, a regulatory non-coding region of the 
SrSiP189 gene encoding the S. radiafum-derived enzyme that 
catalyzes the synthesis of piperitol and sesamin was isolated. 
Using the genomic DNA of S. radiatum as a template, PGR was 
carried out with primers SEQ ID NOs: 75 and 76 (gSST-FWl 
and gSST-RV2, respectively). The primers were designed based 
on the genomic sequence of SiP189 set forth in SEQ ID NO: 
74. 

The reaction mixture contained 1 |il of genomic DNA (50 
ng), 1 X Ex-Taq buffer (TaKaRa), 0.2 mM dNTPs, primers (0.2 
pmol/vil each), and 1.25 U Ex-Taq polymerase. The PGR was 
carried out at 94''C for 5 minutes, and then in 30 cycles at 
94°C for 1 minutes, at 55**C for 1 minute, and at 72*^0 for 4 
minutes. Finally, the product was maintained at 72°C for 4 
minutes. The resulting fragments were subjected to 
electrophoresis, and a fragment of about 3 kb was obtained. 
The fragment was inserted in the multiple cloning site of the 
pCR-TOPO-XL vector (Invitrogen) according to the method 
recommended by the manufacturer. As a result, pSPB2664 
was obtained. The entire nucleotide sequence of the fragment 
inserted in the pSPB2664 was sequenced by a primer walking 
method. As a result, a fragment of about 2.8 kb at the 
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5'-region of the SrSiP189 gene (SEQ ID NO: 77), a probable 
sequence including the regulatory cis-elements of SrSiP189 
gene, was obtained. The same PLACE analysis was performed 
as for the genomic SiP189 gene. The result identified a large 
number of binding sites for a specific transcription factor 
family, along with a large number of elements that responded 
to a specific signal {Fig. 11). 

The SiP189 gene derived from cultivated sesame (S. 
indicum) and the SrSiP189 gene derived from S. radiatum were 
tested for their sequence identity on the DNA level according 
to the Clustal-W analysis (MacVector ver. 7.2.2, Symantech). 
These genes only had 78% sequence identity in the 
5'-non-coding region of about 3 kb, but had a notably high 
sequence identity (96%) in the ORF region of the genes (Fig. 
12). These results support our findings that SiP189 and 
SrSiP189 are highly conserved in terms of protein function 
but have different expression patterns. In addition to RT-PCR 
analyses, these cis-element analyses support that the two 
lignan biosynthetic genes, SiP189 from S. indicum and 
SrSiP189 from S. radiatum are not under an identical 
transcriptional regulation. 
[Example 12] 

S.alatum is an African wild sesame species that greatly 
differs (also in morphology) from cultivated SAndicum (Namiki 
et al., Science of sesame, Asakura Shoten). The number of 
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chromosomes of S.alatum is the same as S.indicum, 2n = 26, 
and the geographical distribution of S.alatum is in Nigeria, 
Sudan, and Mozambique in Africa. 

With the same method of isolating the SrSiP189 gene 
from the African sesame S. radiatum in Example 8, a 
counterpart gene (SaSiP189) of SiP189 was isolated from S, 
alatum. 

cDNA of Stage 4 of S. alatum was used as a template for 
PGR. A fragment of about 1.5kb, that had been amplified with 
primers of SEQ ID No: 61 (Bam-SST-FW2) and SEQ ID NO: 63 
(GR-SST-RVl), was subcloned to the pCR-blunt II TOPO 
(Invitrogen). The nucleotide sequence of the inserted fragment 
was determined by a primer walking method. SEQ ID NO: 78 
shows the amino acid sequence of SaSiP189, and SEQ ID NO: 
79 shows the nucleotide sequence of SaSiP189. 

The resultant SaSiP189 showed 90% sequence identity 
on the DNA level, and 86% sequence homology on the amino 
acid level, as compared with SiP189. These results show that 
the gene SiP189 of lignan biosynthetic enzyme is a highly 
conserved gene, even in the presence of geographical isolation 
or morphological/ cytogenetic difference. 

The embodiments and concrete examples of 
implementation discussed in the foregoing detailed explanation 
serve solely to illustrate the technical details of the present 
invention, which should not be narrowly interpreted within the 
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limits of such embodiments and concrete examples, but rather 
may be applied in many variations within the spirit of the 
present invention, provided such variations do not exceed the 
scope of the patent claims set forth below. 

INDUSTRIAL APPLICABILITY 

Previous research on sesamin has found various 
physiological activities of the compound, and sesamin is now 
known to possess a wide variety of recuperating effects. The 
present invention identified a gene that encodes an enzyme 
for catalyzing the biosynthesis of piperitol from pinoresinol, 
or of sesamin from piperitol. The sesame-derived cytochrome 
P450 gene (SiP gene) allows for production of sesamin and 
piperitol using a recombinant organism, and therefore 
increases sesamin yield and reduces production cost. 

As described above, the sesame-derived SiP 189 gene and 
SrSiP189 gene of the present invention encode cytochrome 
P450 that catalyzes the biosynthesis of piperitol from 
pinoresinol, and of sesamin from piperitol. Being an 
important food source since ancient times, sesame, including 
its seeds, seed oil, and seed extract, continues to hold its title 
as one of the healthiest foods available. Among many benefits 
offered by sesame, its physiological activities have caught the 
attention of many researchers. The SiP189 gene and SrSiP189 
gene identified by the present invention can be used in the 
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production of sesamin, which conventionally relied solely on 
sesame seeds. The present invention therefore holds great 
promise in increasing sesamin yield. 

With these and other advantages of the present 
5 invention, the invention is useful in agriculture, food industry, 

pharmaceutical industry, and all other industries related to 
these fields. 



